Multilingual Authoring Using Feedback Texts

نویسندگان

  • Richard Power
  • Donia Scott
چکیده

There are obvious reasons for trying to automate the product ion of multilingual documentation, especially for routine subject-matter in restricted domains (e.g. technical instructions). Two approaches have been adopted: Machine ~lh'anslation (MT) of a source text, and Multilingual Natural Language Generation (M-NLG) from a knowledge base. For MT, information extraction is a major difficulty, since the meaning must be derived by analysis of the source text; M~NLG avoids this difficulty but seems at first sight to require an expensive phase of knowledge engineering in order to encode the meaning. Wc introduce here a new technique which employs M-NLG during the phase of knowledge editing. A 'feedback text', generated from a possibly incomplete knowledge base, describes in natural language the knowledge encoded so far, and the options for extending it. This method allows anyone speaking one of the supported languages to produce texts in all of them, requiring from the author only expertise in the subject-matter, not expertise in knowlo edge engineering. 1 I n t r o d u c t i o n The product ion of multilingual documentat ion has an obvious practical importance. Companies seeking global markets for their products must provide instructions or other reference materials in a variety of languages. Large political organizations like the European Union are under pressure to provide multilingual versions of official documents, especially when communicating with the public. This need is met mostly by human translation: an author produces a source document which is passed to a number of other people for translation into other languages. Human translation has several well-known disadvantages. It is not only costly but timeconsuming, often delaying the release of the product in some markets; also the quality is uneven and hard to control (Hartley and Paris, 1997). For all these reasons, the product ion of multilingual documentat ion is an obvious candidate for automation, at least for some classes of document. Nobody expects that automat ion will be applied in the foreseeable future for literary texts ranging over wide domains (e.g. novels). tIowever, therc is a mass of non-literary material in restricted domains for which automation is already a realistic aim: instructions for using equipment are a good example. The most direct a t tempt to automize multilingual document production is to replace the human translator by a machine. The source is still a natural language document written by a human author; a program takes this source as input, and produces an equivalent text in another language as output . Machine translation has proved useful as a way of conveying roughly the information expressed by the source, trot the output texts are typically poor and over-literal. The basic problem lies in the analysis phase: the program cannot extract from the source all the information that it needs in order to produce a good output text. This may happen either because the source is itself poor (e.g. ambiguous or incomplete), or because the source uses constructions and concepts that lie outside the program's range. Such problems can be alleviated to some extent by constraining the source doeument, e.g. through use of a 'Controlled Language' such as AECMA (1995). An alternative approach to translation is that of generating the multilingual documents fl'om a non-l inguis t ic source. In the case of automatic Multilingual Natural Language Generation (M-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ITRI - 98 - 05 Multilingual authoring using feedback texts Richard Power and Donia

There are obvious reasons for trying to automate the production of multilingual documentation, especially for routine subjectmatter in restricted domains (e.g. technical instructions). Two approaches have been adopted: Machine Translation (MT) of a source text, and Multilingual Natural Language Generation (M-NLG) from a knowledge base. For MT, information extraction is a major difficulty, since...

متن کامل

Open architecture for multilingual parallel texts

1. Abstract Multilingual parallel texts (abbreviated to parallel texts) are linguistic versions of the same content (" translations "); e.g., the Maastricht Treaty in English and Spanish are parallel texts. This document is about creating an open architecture for the whole Authoring, Translation and Publishing Chain (ATP-chain) for the processing of parallel texts. • Public discussion: with an ...

متن کامل

Symbolic Authoring for Multilingual Natural Language Generation

We describe the symbolic authoring facilities of the M-PIRO project. M-PIRO is developing technology that allows personalized multilingual object descriptions, in both textual and spoken form, to be produced from symbolic information in a database and small fragments of text. The technology is being tested in the context of electronic museums, where a prototype that produces dynamically multili...

متن کامل

Webtran Tools for In-company Language Support

Webtran tools for authoring and translating domain specific texts can make the multilingual text production in a company more efficient and less expensive. The tools have been in production use since spring 2000 for checking and translating product article texts of a specific domain, namely an in-company language in sales catalogues of a mail-order company. Webtran tools have been developed by ...

متن کامل

MuTUAL: A Controlled Authoring Support System Enabling Contextual Machine Translation

The paper introduces a web-based authoring support system, MuTUAL, which aims to help writers create multilingual texts. The highlighted feature of the system is that it enables machine translation (MT) to generate outputs appropriate to their functional context within the target document. Our system is operational online, implementing core mechanisms for document structuring and controlled wri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998